Skip to content

Update vllm-ipex, boost serving performance#1941

Merged
ZePan110 merged 1 commit intoopea-project:mainfrom
Johere:dev/update-vllm-ipex
Nov 17, 2025
Merged

Update vllm-ipex, boost serving performance#1941
ZePan110 merged 1 commit intoopea-project:mainfrom
Johere:dev/update-vllm-ipex

Conversation

@Johere
Copy link
Copy Markdown
Contributor

@Johere Johere commented Oct 31, 2025

Description

Update vllm-ipex, boost serving performance

Issues

n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

  • Others (enhancement, documentation, validation, etc.)

Dependencies

n/a

Tests

Test scripts:
LLM: tests/llms/test_llms_text-generation_service_vllm_ipex_on_intel_arc.sh
LVM: tests/lvms/test_lvms_vllm_ipex_on_intel_arc.sh

@Johere
Copy link
Copy Markdown
Contributor Author

Johere commented Nov 3, 2025

@joshuayao can you help on these failure cases? May not from my side, thanks!

@joshuayao
Copy link
Copy Markdown
Collaborator

@joshuayao can you help on these failure cases? May not from my side, thanks!

Hi @ZePan110 could you please check whether these CI failures are known issues? Thanks.

@joshuayao joshuayao added this to OPEA Nov 10, 2025
@joshuayao joshuayao added this to the v1.5 milestone Nov 10, 2025
@ZePan110
Copy link
Copy Markdown
Collaborator

ZePan110 commented Nov 11, 2025

@joshuayao can you help on these failure cases? May not from my side, thanks!

Hi @ZePan110 could you please check whether these CI failures are known issues? Thanks.

yes. For build failure, I created a PR #1943 to fix them. For bedrock, I created issue #1944 to track.

@joshuayao joshuayao self-requested a review November 12, 2025 07:48
@Johere
Copy link
Copy Markdown
Contributor Author

Johere commented Nov 17, 2025

@joshuayao Hi, it seems there are still failures in CI tests, could you help me to merge this PR?

@ZePan110 ZePan110 merged commit 6c33ec9 into opea-project:main Nov 17, 2025
83 of 96 checks passed
@github-project-automation github-project-automation bot moved this to Done in OPEA Nov 17, 2025
jilongW pushed a commit to jilongW/GenAIComps that referenced this pull request Dec 18, 2025
Signed-off-by: sunzhonghua2004 <137033036@qq.com>
jilongW pushed a commit to jilongW/GenAIComps that referenced this pull request Dec 18, 2025
Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>
joshuayao pushed a commit that referenced this pull request Jan 5, 2026
* Fix build issues (#1937)

* Fix build issues
       Add docling in requirements.in
       Change pathway version to fix dependency conflict.

Signed-off-by: ZePan110 <ze.pan@intel.com>

---------

Signed-off-by: ZePan110 <ze.pan@intel.com>
Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* Add Arbitration Post-Hearing Component with LLM-Based Entity Extraction (#1938)

* initial commit for arbitratory micro service

* test cases added

* test cases added

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* renamed test file as per review comment

* updated path of SCRIPT_DIR in to resolve microservice test

* resolved comments wrt license header and env configs in compose file

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* removed unused MODEL env variable Signed-off-by: Author Name <c.noeljaymon@zensar.com>

* removed space and added sign off

Signed-off-by: Noel Jaymon <c.noeljaymon@zensar.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* resolved ci issues for compose file name and readme reference paths

Signed-off-by: Noel Jaymon <c.noeljaymon@zensar.com>

* added arb_post_hearing_assistant-compose.yaml file in .github folder

Signed-off-by: Noel Jaymon <c.noeljaymon@zensar.com>

* deleted unused file redis-values.yaml

Signed-off-by: Noel Jaymon <c.noeljaymon@zensar.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* made service and image name same for arb_post_hearing_assistant-compose.yaml in .github folder

Signed-off-by: Noel Jaymon <c.noeljaymon@zensar.com>

* DCO remediation: adding missing Signed-off-by lines

Signed-off-by: Ramesh <katkuri.ramesh@zensar.com>

* fixed the micro service build issue

Signed-off-by: Ramesh <katkuri.ramesh@zensar.com>

* microservice container not found fixed

Signed-off-by: Ramesh <katkuri.ramesh@zensar.com>

* removed airgap code

Signed-off-by: Ramesh <katkuri.ramesh@zensar.com>

* fixed pre-commit check issue

---------

Signed-off-by: Noel Jaymon <c.noeljaymon@zensar.com>
Signed-off-by: Ramesh <katkuri.ramesh@zensar.com>
Co-authored-by: Ramesh Katkuri <rameshkatkuri@Rameshs-MacBook-Air.local>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Noel Jaymon <c.noeljaymon@zensar.com>
Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* add funasr paraformer asr service impl (#1914)

* add funasr paraformer asr service impl

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix requirements deps; modify ASR READMEs

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* add funasr-paraformer dockerfile in github workflow

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* enable HF_TOKEN to be defined in request (#1940)

Signed-off-by: wwanarif <wan.abdul.hakim.b.wan.arif@intel.com>
Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* fix the source of LibreOffice (#1942)

Signed-off-by: zhihang <zhihangdeng@link.cuhk.edu.cn>
Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* Fix CD issue and llms build failure (#1943)

* Fix permissions issue

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Fix issue

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Test

Signed-off-by: ZePan110 <ze.pan@intel.com>

* Revert "Test"

This reverts commit 1df372c.

---------

Signed-off-by: ZePan110 <ze.pan@intel.com>
Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* update vllm-ipex, boost servi performance (#1941)

Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* Add openGauss support to dataprep microservice and update related doc… (#1945)

* add openGauss support for dataprep

Signed-off-by: sunshuang1866 <sunshuang1866@outlook.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix the healthcheck for openGauss

Signed-off-by: sunshuang1866 <sunshuang1866@outlook.com>

* update README.md for openGauss

Signed-off-by: sunshuang1866 <sunshuang1866@outlook.com>

---------

Signed-off-by: sunshuang1866 <sunshuang1866@outlook.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* Add openGauss support to retrievers and update related doc… (#1949)

* add openGauss support for retrievers

Signed-off-by: sunshuang1866 <sunshuang1866@outlook.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: sunshuang1866 <sunshuang1866@outlook.com>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* update package version to fit B60

Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* enable CnClip B/16&L/14 and enable flickr30kcn dataset

Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* update docs about qwen2-vl &qwen2.5-vl & cnclip, add qwen-vl configs

Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* fix bug

Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* fix bug

Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* fix for doc

Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* fix gradio verion

Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* retrigger test

Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

* added test for cnclip

Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>

---------

Signed-off-by: ZePan110 <ze.pan@intel.com>
Signed-off-by: sunzhonghua2004 <jilong.wang@intel.com>
Signed-off-by: Noel Jaymon <c.noeljaymon@zensar.com>
Signed-off-by: Ramesh <katkuri.ramesh@zensar.com>
Signed-off-by: wwanarif <wan.abdul.hakim.b.wan.arif@intel.com>
Signed-off-by: zhihang <zhihangdeng@link.cuhk.edu.cn>
Signed-off-by: sunshuang1866 <sunshuang1866@outlook.com>
Co-authored-by: ZePan110 <ze.pan@intel.com>
Co-authored-by: ramesh-katkuri <katkuri.ramesh@zensar.com>
Co-authored-by: Ramesh Katkuri <rameshkatkuri@Rameshs-MacBook-Air.local>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Noel Jaymon <c.noeljaymon@zensar.com>
Co-authored-by: LIU Lin <107393642+llin60@users.noreply.github.com>
Co-authored-by: wanhakim <wanhakim92@gmail.com>
Co-authored-by: zhihang <zhihangdeng@link.cuhk.edu.cn>
Co-authored-by: linjiaojiao <jiaojiao.lin@intel.com>
Co-authored-by: sunshuang1866 <sunshuang1866@outlook.com>
Co-authored-by: Xueshu Wang <xueshu.wang@intel.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

3 participants